Testing the Use of N-gram Graphs in Summarization Sub-tasks
نویسندگان
چکیده
Within this article, we sketch the set of generic tools we have devised and used within the summarization process and the domain of summary evaluation, focusing on how the tools were used within the TAC 2008 summarization update challenge. The tools have a common underlying theory and provide utility in various aspects of the Natural Language Processing domain. Within this study we elaborate on query expansion, content matching and filtering, redundancy removal as well as summary evaluation.
منابع مشابه
Automatic Summarization from Multiple Documents
This work reports on research conducted on the domain of multi-document summarization using background knowledge. The research focuses on summary evaluation and the implementation of a set of generic use tools for NLP tasks and especially for automatic summarization. Within this work we formalize the n-gram graph representation and its use in NLP tasks. We present the use of n-gram graphs for t...
متن کاملGraph Hybrid Summarization
One solution to process and analysis of massive graphs is summarization. Generating a high quality summary is the main challenge of graph summarization. In the aims of generating a summary with a better quality for a given attributed graph, both structural and attribute similarities must be considered. There are two measures named density and entropy to evaluate the quality of structural and at...
متن کاملMUDOS-NG: Multi-document Summaries Using N-gram Graphs (Tech Report)
This report describes the MUDOS-NG summarization system, which applies a set of language-independent and generic methods for generating extractive summaries. The proposed methods are mostly combinations of simple operators on a generic character n-gram graph representation of texts. This work defines the set of used operators upon n-gram graphs and proposes using these operators within the mult...
متن کاملROUGE 2.0: Updated and Improved Measures for Evaluation of Summarization Tasks
Evaluation of summarization tasks is extremely crucial to determining the quality of machine generated summaries. Over the last decade, ROUGE has become the standard automatic evaluation measure for evaluating summarization tasks. While ROUGE has been shown to be effective in capturing n-gram overlap between system and human composed summaries, there are several limitations with the existing RO...
متن کاملMulti-document summaries using n-gram graphs: salience and redundancy
This paper describes a summarization system that aims to provide a set of languageindependent and generic methods for generating extractive summaries. The proposed methods are realized as operators to a generic character n-gram graph representation of texts, towards the selection of content and removal of redundancy. This work defines the set of generic operators upon n-gram graphs and proposes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008